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Abstract — Exact queueing analysis of erasure networks with 
networlf coding in a finite buffer regime is an extremely hard 
problem due to the large number of states in the network. In 
such networks, packets are lost due to either link erasures or due 
to blocking due to full buffers. In this paper, a block-by-block 
random linear network coding scheme with feedback on the 
links is selected for reliability and more importantly guaranteed 
decoding of each block. We propose a novel method that 
iteratively estimates the performance parameters of the network 
and more importantly reduces the computational complexity 
compared to the exact analysis. The proposed framework yields 
an accurate estimate of the distribution of buffer occupancies 
at the intermediate nodes using which we obtain analytical 
expressions for network throughput and delay distribution of 
a block of packets. 

I. Introduction 

In networks, packets transfer through a series of intermedi- 
ate nodes (routers). Hence, the packets may have to be stored 
at intermediate nodes for transmission at a later time, and 
often times, buffers are limited in size. Although a large buffer 
size is usually affordable and preferred to minimize packet 
drops, large buffers have an adverse effect on the latency of 
the network. Further, using larger buffer sizes at intermediate 
nodes will cause practical problems such as on-chip board 
space and increased memory-access latency. 

The problem of buffer sizing and congestion control is of 
paramount interest to router design engineers. Typical routers 
today route several tens of gigabits of data each second fTl. 
Realistic studies have shown that, at times, Internet routers 
handle about ten thousand independent streams/flows of data 
packets. With a reasonable buffer size of few Gigabytes of 
data, each stream can only be allocated a few tens of data 
packets. Therefore, at times when long parallel flows congest 
a router, the effects of such a small buffer space per flow 
come to play. This work aims at providing a framework to 
analyze the performance of single information flow in finite- 
buffer line networks and investigates the trade-offs between 
throughput, delay and buffer size. 

The problem of computing throughput capacity and de- 
signing efficient coding schemes for lossy networks has been 
widely studied S-l?). However, the study of capacity of 
networks with finite buffer sizes has been limited iSj-lSl. 
This can be attributed solely to the fact that finite buffer 
systems are analytically hard to track. In 131, ||6l, it was shown 
that min-cut capacity cannot be achieved due to the limited 
buffer constraint in a line network. They also proposed upper 



and lower bounds for the throughput capacity of wireline 
networks. Recently in |7|, authors presented an iterative 
estimation method to analyze the performance of a random 
routing scheme in general wired networks. Here, we will 
exploit similar ideas (on iterative estimation) to analyze the 
performance of block-based network coding scheme}^ which 
is a combination of both random linear coding and an ideal 
feedback. This is partly motivated by the feedback-based 
network coding scheme in ||9l, ifTOl . Further, the problem 
of finding the packet delay has been visited in queueing 
theory literature on the behavior of open tandem queues, 
which are analogous to line networks IfTTI . lfT2ll . However, the 
approaches therein cannot be used to study the performance 
of the random linear coding (RLC) scheme. 

This work is motivated by the fact that the RLC scheme 
for finite-buffer networks introduced in 1 8 1 has the limitation 
that it cannot be used to characterize the latency profile. This 
is because, the typical notion of latency is not meaningful 
for the RLC scheme. Since latency is critical for real-time 
applications (such as video streaming), a block-by-block 
encoding of the stream is required. Hence, we introduce our 
Block-based Random Linear Coding scheme, which applies 
RLC on each individual bloclfl We will see that our approach 
guarantees a certain decoding delay while this is not the case 
in RLC, which is a rate-optimal scheme with large decoding 
delay. Further, by using RLC, only one feedback is required 
for transmission of K packets which considerably limits the 
average number of feedbacks per transmission. 

This paper is organized as follows. First, we present a 
formal definition of the problem and the network model in 
Section Next, we define the network states and the need 
for an approximation of the exact problem. We investigate 
the tools and steps for finite-buffer analysis in Section |IVl 
We then obtain expressions for throughput and probability 
distribution of delay in Section[Vl Finally, Section [Vll presents 
our analytical results compared to the simulations. 



'In the network coding literature, it is also referred as generation-based 
scheme, where each intermediate node combines only those pacJcets that from 
the same generation. 

-Just as in any networlc coding scheme, the paclcets received by the 
destination are linear combinations of the original data in a block. Hence, 
with the knowledge of this linear transformation at the decoder, inversion 
can be performed to recover the data block. 
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II. Problem Statement and Network Model 

We consider line networks for our study. As illustrated in 
Fig. [T] a line network is a directed graph of h hops with 
the vertex set V ~ {vq, vi, . . . ,Vh} and the edge set 1^ = 
{{vi, Wi+i) : i = 0, ■ . ■ , h — 1} for some integer h > 2. In 
the figure, the intermediate nodes are shown by black ovals. 
The links are assumed to be unidirectional, memoryless and 
lossy. We let Si denote the packet erasure probability over the 
Unk {vi-i,Vi). The erasures model only the quality of links 
(e.g., presence of noise, interference) and do not represent 
packet drops due to finite buffers. Each node Vi G V has 
a buffer size of packets. It is assumed the destination 
node has no buffer constraints and that the source node has 
infinitely many innovative packet^. The system is analyzed 
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Fig. L An illustration of the line network. 

using a discrete-time model, where each node transmits one 
packet over a link per epoch. The unicast capacity between a 
pair of nodes is defined to be the supremum of all achievable 
rates of transmission of information packets (in packets per 
epoch) between the pair of nodes over all possible means 
used for packet generation and buffer update at intermediate 
nodes. The capacity is discussed thoroughly by the authors 
in ifTSl . However, in this work we are not aiming to analyze 
the fundamental capacity of the network using rate optimal 
schemes. A practical network coding scheme, which is well- 
suited for transmitting real-time data streams in a block-by- 
block fashion using RLC and feedback, is introduced. The 
proposed scheme guarantees certain decoding delay, which is 
not the case in the RLC. 

In the proposed practical scheme, the source node takes 
the stream of packets and divides them into blocks of K 
packets each. The buffer of each intermediate node Vi (1 V 
is then segmented into Mi blocks. In other words, we have 
mi = MiK. Each block is then served using RLC over 
all the packets in the block. The blocks are served based 
on a. first-come first-serve policy. An instant lossless hop-by- 
hop acknowledgment per block is also employed to indicate 
the successful receipt of a complete block of K packets. In 
each epoch, one or multiple of the following events occur in 
different orders: 
1. If a node neither receives any innovative packet nor 
conveys any innovative packet to the next node, then 
the content of its buffer does not chang^ 

^A received packet is called innovative with respect to a node if the packet 
cannot be generated by a linear combination of the current buffer contents 
of the node. 

'^In this work, by conveying a packet, we mean that the packet is 
successfully transmitted and stored at the next-hop node. 



2. Upon receiving an innovative packet, it will be stored 
in the last available block (a block of memory with less 
than K innovative packets stored in it). The packets of 
this block will be served after all the previously received 
blocks in the buffer are completely served. 

3. In each epoch, every node transmits a packet formed by 
the RLC encoding over its current block (oldest block in 
the queue), until the node receives an acknowledgment 
indicating that the block is fully conveyed to the next-hop 
node. The block will then be removed from the buffer 
and the next block in the queue will be served. This also 
implies that free space in the buffer will be increased by 
K packets. 

To implement the per-block feedback mechanism, a node 
must distinguish innovative packets upon their reception. One 
way is to compute the rank of the received packets in a 
block. A more practical protocol is to use a variable CMB in 
the header of each encoded packet, indicating the number of 
innovative packets used by RLC to form the packet. Further, 
every node maintains a counter INV indicating the number 
of innovative packets received in its current block so far. 
Every node sets INV = for each new incoming bloclJl and 
increments it by one for each incoming packet whose CMB 
is greater than INV of the receiving node. Note that, if the 
current block in the queue of a node has K innovative packets 
then CMB is equal to K for all the packets to be transmitted 
by that node. 

In this work, we will employ the following notations. For 
any x G [0, 1], x = 1 — x. The convolution operator is denoted 
by (g) and ®^ f is used as a shorthand for the /-fold convolution 
of / with itself. For < A < 1, G(A) denotes the probability 
mass function of a random variable that is geometric with 
mean jz^- 

III. Exact analysis and Network States 

In 13], a Markov-chain approach for exact analysis of a 
finite-buffer line network identifies the throughput as equiv- 
alent to the problem of finding the buffer occupancy distri- 
bution of the intermediate nodes. However, the size of the 
Exact Markov Chain (EMC) and the multiple reflections due 
to the finiteness of buffers at each intermediate node render 
this problem mathematically intractable for even networks 
of small hop-lengths and buffer sizes. We therefore aim to 
approximate the distribution of buffer occupancies. 

To approach this approximation problem properly, it is 
necessary to clearly define the buffer states in a manner 
that (a) an irreducible ergodic Markov chain is obtained, and 
(b) the steady-state distribution of the chain allows tractable 
expressions for the performance parameters of the network. 
Thus, a proper definition for the buffer states cannot be 
proposed unless the communication scheme is known. For the 
scheme in Sec. [Ill two variables are needed to track all the 
buffer states of a node. Let s be the total number of innovative 
(w.r.t. the next-hop node) packets stored at a node. Denote t 
to be the number of successfully conveyed innovative packets 

^If INV = K, then the counter is reset to and an acknowledgment is 
sent to the previous node. 
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by the node from the current block. Then, the pair (s, t) can 
be defined as the state of the buffer of the node. Note that s 
is the minimum number of packets that a node has to store in 
its buffer. Also note that, since a new block starts to be served 
after the /C* packet of the cuiTent block, t e {0, . . . , K -1}. 
As an example, assuming that node Vi is in state {s,t) at the 
start of the epoch, given that during the epoch it only sends a 
packet successfully but does not receive any packets, the state 
of the network will change to (s, t+1) ift = {0, 1, . . . , K—2} 
or it will change to (s — K,0) if t = K — 1. 

IV. A Framework for Finite-Buffer Analysis: 
Markov Chain Modeling 

In this section, we aim to determine the distribution of 
buffer occupancy of an intermediate node, which will later 
be used to analyze network parameters such as throughput 
and latency of a block. 

Due to the discrete-time nature of the analysis framework, 
two Markov chains need to be constructed for each inter- 
mediate node. The first one considers the buffer occupancy 
at instants when a packet has just been transmitted (either 
successfully or unsuccessfully), which is called receive-first 
Markov chain (RFMC). This is required to compute the prob- 
ability of blocking, which is caused when the state of a node 
is forced to remain unchanged because the transmitted packet 
was successfully delivered to the next-hop node, but the latter 
does not store the packet due to full buffer occupancy . The 
second one considers the buffer occupancy at instants when a 
packet has just been received/stored, which is called transmit- 
first Markov chain (TFMC). This will be used to calculate the 
incoming rate of innovative packets at each node. 

Note that the problem of exactly identifying the steady- 
state probabilities of the RFMC and TFMC suffers the same 
difficulties as identifying that of the EMC [13|. The finite 
buffer condition introduces a strong dependency of state 
update at a node on the state of the node that is downstream. 
To develop an estimation scheme that considers blocking, we 
make the following assumptions. 

1. Packets are ejected from nodes in a memoryless fashion. 
This assumption allows us to keep track of only the 
information rate. 

2. The blocking event occurs independent of the state 
of a node. This allows us to track just the blocking 
probability. 

3. At any epoch, given the occupancy of a particular node, 
packet arrival and blocking events are independent of 
each other. 

These assumptions spread the effect of blocking equally 
over all non-zero states of occupancy at each node. Now, 
given that the arrival rate of innovative packets at node Vi is r.i 
packets/epoch, and that the probability of the next node being 
full, (i.e., s — m,i+i for node fi+i) is Pbi+i, we can show that 
transition dynamics of the state change for node Vi is given 
by the Markov chain depicted in Fig. |2] for both TFMC and 
RFMC 0. Also, obtaining the state transition probabilities is 

*SeIf-Ioops are not demonstrated in the figure. 




Fig. 2. The general structure for both TFMC and RFMC for a node with 
buffer size m = MK. 



Straightforward using r^, Pbi+i, £i and Ei+i. As an example, 
for TFMC we have the following 

{Ti S 
n{ei+i +e.^+ipb,+i) s - 1 > t . (1) 
Otherwise 

Note that we notate s and t instead of Si and ti for the 
simplicity of notation when considering node Vi. The same 
transition probability for RFMC is different from TFMC and 
is given by 

P"s''t)^{s+ut) = r,{e^+i + e^+lPb,+l). (2) 

Note that, both ([Hi and Q are valid for s — 
{0,1,..., MiK - 1} and i = {0, 1, . . . , -ft: - 1}. 

For all input parameters, the Markov chains can be shown 
to be aperiodic, irreducible and ergodic. Therefore, it pos- 
sesses a unique steady-state distribution. The steady state 
probability of node Vi being in state (s,t), is denoted by 
Pf^(s,t) and P[^is,t) for RFMC and TFMC, respectively. 

The blocking probability that the node Vi-i perceives from 
the node Vi is the same as the probability of Vi being full 
(s = MiK) at the instant when a packet is transmitted suc- 
cessfully by the previous node. Hence, this probability have 
to be calculated using the steady state probability distribution 
of RFMC as follows 

A'-l 

t=o 

Similarly, the steady state probability distribution of TFMC 
can be used to compute the arrival rate at the next node using 

A'-l 

r,+i = (l- (4) 
t=o 

Note that if a node is in the state (i, t) (t £ {0,. . .,K - 
1}), it means that it has stored t innovative packets from the 
current block so far and it has also sent t linear combinations 
of them successfully. Therefore, there is no more innovative 
packets to send. 

Finally, Given two vectors r = {ri,...,rfi) G [0,1]'' 
and pb = {Pbi,---,Pbh) G [0,1]'', we term (r,pb) as an 
approximate solution to the exact problem, if they satisfy 
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(|3]l and (|4|l in addition to having ri = ei and pi,,^ = 0. 
Fortunately, the resuh from our previous work on the capacity 
of Hne networks in |13| can be appHed to this setup as well 
and guarantees both the uniqueness and an algorithm for 
identifying the approximate solution. 

The approximate solution is obtained iteratively by the 
following procedure: 

1. Initialization: r = (ei, . . . ,eh) and pb = (0, . . . , 0) 

2. Construct TFMC and RFMC respectively and compute 
their steady-state probabilities. 

3. Using Pl^''{s,t) and Pr{s,t) (obtained from step 2) 
calculate the new values for r and pb by ^ and 
(IHl, respectively, for i = l,2,...,ft, — 1 and auxiliary 
equations ri = 1 — ei and pb,j = 0. 

4. Repeat steps 2 and 3 until all the distributions converge. 

V. Computation of Network Parameters 

In this section, we exploit the results of the iterative 
estimation of buffer occupancy distributions in Sec. |IV] to 
obtain analytical expressions for both network throughput and 
delay distribution of a block. The Block Delay is defined as 
the time taken for a block of K information packets (at the 
source) to be transferred through a line network from the 
instant when the first packet of that block is transmitted from 
the source node to the instant when the if* innovative packet 
of that block is received by the destination node (i.e., the block 
can be decoded at the destination). 

Given a line network with Unk erasures f = (ei, . . . , e/j), 
intermediate node buffer sizes M. = (mi, ... , rrih-i), we can 
find the approximate solution (r, pb). Using this, an estimate 
of the throughput is obtained using the following 

Ci£,M,K)=rh{l-pbh)=rh. (5) 

To compute the distribution of the total block delay, one 
can proceed in a hop-by-hop fashion in the following way: 

D = Ti ® Wi ® W2 . . . ® W^_2 ® F, (6) 

where Ti is the probability distribution of the time taken 
for a packet in the source to be conveyed to node vi . Further, 
Wi is the probability distribution of the time taken from the 
instant when node Vi stores the first innovative packet of a 
block to the instant the first packet of the corresponding block 
in Vi is conveyed to node u^+i. Finally F is the probability 
distribution of the time taken for all the K packets of a 
block in node Vh-i to be conveyed to the destination node 
from the instant when the first innovative packet of the same 
block is stored in the buffer of node Vh-i- Thus, using the 
definition, we have Ti = G(ei), where e'l is the effective 
erasure probability (after considering blocking) and is given 
by 

-^(e,+pb^s^ i = l,2,...,h-l 

' \ Sh i = h ■ ^ ' 

Also, the average waiting time in node Vi is formulated as 

A/,-1 A'-l 

W, = x,(0,0)S,(0,0)+ J2 ^ 7r,(di^,t)S,(di^,t), (8) 
d=l t=o 



where, TTi{s,t) is the probability that an arriving packet 
finds node Vi in state (s, t) given that it is the first packet 
of its corresponding block. Also, Si{s,t) is the probability 
distribution of the time taken for the first innovative packet 
of a block in Vi to be conveyed to node w^+i from the instant 
when the first innovative packet of that block arrives at node 
Vi and finds its buffer at state {s,t). 

If an arriving packet is the first of its corresponding block, it 
finds the buffer at states of the form {dK, t) where d can take 
any value between and Mi — 1. This is because of the fact 
that the last block had been completely served before the first 
packet of the current block arrives. Hence, both Si{s,t) and 
TTi{s, t) will be if s is not a multiple of K. Finally, 7ri(s, t) 
can be formulated as follows for n G {0, 1, ... , Mi — 1}, 

Ms,t)^\ eST^^tc^ , (9) 

[ Otherwise 

where $f^(s) is the marginal probability distribution of s for 
an arriving packet (i.e., The probability that an arriving packet 
finds the buffer of node Vi in the states of the form (s, .)). $i 
can be computed as follows 

niin{K — l,s} 

P^'i^'t)■ (10) 

Also, Si{s,t) can be derived using the following relation 

for n G {0, 1, . . .,M, - 1} 

[ Otherwise 

Finally, F can be calculated by taking the average over all 
the conditional delay distributions L(s,i) as 

nih-l -1 A'-l 

F = 7r^_i(0,0)L(0,0)+ J2 7r„_i(di^,t)L(dif,0, 

d=l t=o 

where L(s,t) is the probability distribution of the time taken 
for the whole block to be conveyed to the destination node 
given that the first packet of that block found the buffer 
of node Vh-i in state (s,t) when arrived. Note that after 
receiving the first packet of a block, node Vh-i has to wait 
until all its previously stored blocks are conveyed to the desti- 
nation, during which some of the packets of the corresponding 
block might have already been arrived. Let V(a;, y) be the 
probability distribution of the time to convey y innovative 
packets to the next node when x of those packets (x < y) has 
yet to arrive for the same block, knowing that a packet departs 
with probability Pout and an innovative packet arrives with 
probability Pin in each time epoch. Hence, L(s,i) is derived 
using the following relation for n G {0, 1, . . . , Mi — 1} 

r V{K-1,K) s = 

L{s,t)^l {«)"("'*)G(£,0}+V(i^-/3,i^) s^nK , 
[ otherwise 

where a{n,t) = {n — \)K + {K — t) is the number of 
packets that have to leave node before the next block 
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to be served and 13 = mm{K — 1, 



a(n,t)Pi„ 
Po^t 



} is the 



expected number of packets from the corresponding block 
that arrived during the time when those Q;(n, packets were 
being conveyed. Further, Pin = - Y.t=o 0) 

for h > 2, Pin = Eh-i for h = 2, and Pout = £h- V(a;, y) 
will be determined by the following lemma. 

Lemma 1: Y{x,y) (defined for x < y) is the solution to 
the following equation, 

V(.,y) = [PiV(^^ j^ - 1) +P2V(. - hy) +p,^rix -l.y~ 



Pi + P2 + P3 



with boundary conditions: 



V(0,y) 
V{x,x) 



(g>y(G{i - Pout) 

G(l-P„0®V(a;-l,x) 



where. 



Pi = Poutil - -Pm) 
7^3 — Pi7iPout 



P2 
Pi 



Pin (l Pout) 



(11) 



(12) 



{l~P,n){l~Pout). 

Proof: The proof is omitted due to space limitations. ■ 

VI. Results of Simulation 

In this section, we compare our analytical results to the 
actual simulations. To study the effect of buffer size on 
throughput and block delay, we simulated a line network 
of eight hops for two cases where all the links have the 
same probability of erasure of 0.1 or 0.2. The buffer size 
m (in packets) is divided into M blocks of K packets. Fig. [3] 
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Fig. 3. Delay and thi'oughput of an 8-hop line network as a function of m. 

presents the variation of our analytical results and the actual 
simulations for both throughput and average delay of a block, 
as the buffer size m of the intermediate nodes is varied 
while the block size is fixed at = 5 packets. It can be 
seen that as the buffer size is increased, average delay also 
increases linearly. It appears that above memory sizes of 10, 
the gain in capacity is negligible, while the latency increases 
significantly. Hence, there is no need to allocate more storage 
to the flow even if the space is available in the router. Further, 
for buffer sizes of less than 10 packets, there is a gap from 
the min-cut capacity, a diverging point from asymptotic results 
due to the finite buffer effect. Fig. |4] presents a comparison 
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Fig. 4. Delay profiles of a 5-hop line network for varying buffer sizes 
{K = 4). 



between the actual and the estimated delay profile for a five- 
hop line network with the erasure probability on every link 
set to 0.05. It also compares the delay profiles for different 
buffer sizes when X = 4. It is noticed that as the buffer size 
of the intermediate nodes is increased, both average delay and 
its standard deviation are increased. This is undesirable since 
any increase in the standard deviation of the delay can make 
congestion control algorithms unstable. 
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